NSF PAR Search | NSF Public Access Repository

Note: When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher. Some full text articles may not yet be available without a charge during the embargo (administrative interval).
What is a DOI Number?

Some links on this page may take you to non-federal websites. Their policies may differ from this site.

SLATE: Service Layer Traffic Engineering

Lim, Gangmuk; Prerepa, Aditya; Godfrey, Brighten; Mittal, Radhika (May 2026, USENIX)

Free, publicly-accessible full text available May 4, 2027
Network Support For Scalable And High-Performance Cloud Exchanges

Haseeb, Muhammad; Geng, Jinkun Geng; Duclos-Cavalcanti, Daniel; Hao, Xiyu; Butler, Ulysses; Mittal, Radhika; Narayana, Srinivas; Sivaraman, Anirudh (September 2025, SIGCOMM 2025)

Free, publicly-accessible full text available September 1, 2026
Opportunities and Challenges in Service Layer Traffic Engineering

https://doi.org/10.1145/3696348.3696871

Lim, Gangmuk; Prerepa, Aditya; Godfrey, Brighten; Mittal, Radhika (November 2024, ACM)

Optimizing request routing in large microservice-based applications is difficult, especially when applications span multiple geo-distributed clusters. In this paper, inspired by ideas from network traffic engineering, we propose Service Layer Traffic Engineering (SLATE), a new framework for request routing in microservices that span multiple clusters. SLATE leverages global knowledge of cluster states and multi-hop application graphs to centrally control the flow of requests in order to optimize end-to-end application latency and cost. Realizing such a system requires tackling several technical challenges unique to service layer, such as accounting for different request traffic classes, multi-hop call trees, and application latency profiles. We identify such challenges and build a preliminary prototype that addresses some of them. Preliminary evaluations of our prototype show how SLATE outperforms the state-of-the-art global load balancing approach (used by Meta’s Service Router and Google’s Traffic Director) by up to 3.5× in average latency and reduces egress bandwidth cost by up to 11.6×.
more » « less
Full Text Available
Efficient Policy-Rich Rate Enforcement with Phantom Queues

https://doi.org/10.1145/3651890.3672267

Tahir, Ammar; Goyal, Prateesh; Marinos, Ilias; Evans, Mike; Mittal, Radhika (August 2024, ACM)

Rate enforcement is routinely employed in modern networks (e.g. ISPs rate-limiting users traffic to the subscribed rates). In addition to correctly enforcing the desired rates, rate-limiting mechanisms must be able to support rich rate-sharing policies within each traffic aggregate (e.g. per-flow fairness, weighted fairness, and prioritization). And all of this must be done at scale to efficiently support the vast magnitude of users. There are two primary rate-limiting mechanisms -- traffic shaping (that buffers packets in queues to enforce the desired rates and policies) and traffic policing (that filters packets as per the desired rates without buffering them). Policers are light-weight and scalable, but do not support rich policy enforcement and often provide poor rate enforcement (being notoriously hard to configure). Shapers, on the other hand, achieve desired rates and policies, but at the cost of high system resource (memory and CPU) utilization which impacts scalability. In this paper, we explore whether we can get the best of both worlds -- the scalability of a policer with the rate and policy enforcement properties of a shaper. We answer this question in the affirmative with our system BC-PQP. BC-PQP augments a policer with (i) multiple phantom queues that simulate buffer occupancy using counters, and enable rich policy enforcement, and (ii) a novel burst control mechanism that enables auto-configuration of the queues for correct rate enforcement. We implement BC-PQP as a middlebox over DPDK. Our evaluation shows how it achieves the rate and policy enforcement properties close to that of a shaper with 7x higher efficiency.
more » « less
Full Text Available
TraceWeaver: Distributed Request Tracing for Microservices Without Application Modification

https://doi.org/10.1145/3651890.3672254

Ashok, Sachin; Harsh, Vipul; Godfrey, Brighten; Mittal, Radhika; Parthasarathy, Srinivasan; Shwartz, Larisa (August 2024, ACM)

Full Text Available
Octopus: In-Network Content Adaptation to Control Congestion on 5G Links

Chen, Yongzhou; Tahir, Ammar; Yan, Francis; Mittal, Radhika (February 2024, ACM/IEEE)

It is challenging to meet the bandwidth and latency requirements of interactive real-time applications (e.g., virtual reality, cloud gam- ing, etc.) on time-varying 5G cellular links. Today’s feedback-based congestion controllers try to match the sending rate at the endhost with the estimated network capacity. However, such controllers can- not precisely estimate the cellular link capacity that changes at timescales smaller than the feedback delay. We instead propose a different approach for controlling congestion on 5G links. We send real-time data streams using an imprecise controller (that errs on the side of overestimating network capacity) to ensure high through- put, and then adapt the transmitted content by dropping appropriate packets in the cellular base stations to match the actual capacity and minimize delay. We build a system called Octopus to realize this ap- proach. Octopus provides parameterized primitives that applications at the endhost can configure differently to express different content adaptation policies. Octopus transport encodes the corresponding app-specified parameters in packet header fields, which the base- station logic can parse to execute the desired dropping behavior. Our evaluation shows how real-time applications involving standard and volumetric videos can be designed to exploit Octopus, and achieve 1.5–18× better performance than state-of-the-art schemes.
more » « less
Full Text Available
Octopus: In-Network Content Adaptation to Control Congestion on 5G Links

Chen, Yongzhou; Tahir, Ammar; Yan, Francis; Mittal, Radhika (February 2024, IEEEACM Symposium on Edge Computing)

It is challenging to meet the bandwidth and latency requirements of interactive real-time applications (e.g., virtual reality, cloud gaming, etc.) on time-varying 5G cellular links. Today’s feedback-based congestion controllers try to match the sending rate at the endhost with the estimated network capacity. However, such controllers cannot precisely estimate the cellular link capacity that changes at timescales smaller than the feedback delay. We instead propose a different approach for controlling congestion on 5G links. We send real-time data streams using an imprecise controller (that errs on the side of overestimating network capacity) to ensure high throughput, and then adapt the transmitted content by dropping appropriate packets in the cellular base stations to match the actual capacity and minimize delay. We build a system called Octopus to realize this approach. Octopus provides parameterized primitives that applications at the endhost can configure differently to express different content adaptation policies. Octopus transport encodes the corresponding app-specified parameters in packet header fields, which the basestation logic can parse to execute the desired dropping behavior. Our evaluation shows how real-time applications involving standard and volumetric videos can be designed to exploit Octopus, and achieve 1.5–18× better performance than state-of-the-art schemes.
more » « less
Full Text Available
On-Device CPU Scheduling for Robot Systems

https://doi.org/10.1109/IROS47612.2022.9982085

Partap, Aditi; Grayson, Samuel; Huzaifa, Muhammad; Adve, Sarita; Godfrey, Brighten; Gupta, Saurabh; Hauser, Kris; Mittal, Radhika (October 2022, 2022 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS))

Full Text Available
On the Use of ML for Blackbox System Performance Prediction

Fu, Silvery; Gupta, Saurabh; Mittal, Radhika; Ratnasamy, Sylvia (January 2021, NSDI)
null (Ed.)
Full Text Available
Fast and Efficient Container Startup at the Edge via Dependency Scheduling

Fu, Silvery; Mittal, Radhika; Zhang, Lei; Ratnasamy, Sylvia (June 2020, HotEdge)

Full Text Available

« Prev Next »

Search for: All records